Utilizing text mining results: The Pasta Web System

نویسندگان

  • George Demetriou
  • Robert J. Gaizauskas
چکیده

Information Extraction (IE), defined as the activity to extract structured knowledge from unstructured text sources, offers new opportunities for the exploitation of biological information contained in the vast amounts of scientific literature. But while IE technology has received increasing attention in the area of molecular biology, there have not been many examples of IE systems successfully deployed in end-user applications. We describe the development of PASTAWeb, a WWWbased interface to the extraction output of PASTA, an IE system that extracts protein structure information from MEDLINE abstracts. Key characteristics of PASTAWeb are the seamless integration of the PASTA extraction results (templates) with WWWbased technology, the dynamic generation of WWW content from ‘static’ data and the fusion of information extracted from multiple documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms

Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...

متن کامل

Prediction of user's trustworthiness in web-based social networks via text mining

In Social networks, users need a proper estimation of trust in others to be able to initialize reliable relationships. Some trust evaluation mechanisms have been offered, which use direct ratings to calculate or propagate trust values. However, in some web-based social networks where users only have binary relationships, there is no direct rating available. Therefore, a new method is required t...

متن کامل

Utilizing ARM Technique in Mining Textual Data

Text mining, as one major school in Knowledge Discovery in Data (KDD), mines hidden patterns, rules, regularities and trends from textual data / non-database-data (i.e., text files, web documents, etc.). It is quite different from data mining (another well-known major school in KDD): the data structure of texts, dealt by text mining, is considered implicit, whereas traditional database-data, de...

متن کامل

Protein Structures and Information Extraction from Biological Texts: The PASTA System

MOTIVATION The rapid increase in volume of protein structure literature means useful information may be hidden or lost in the published literature and the process of finding relevant material, sometimes the rate-determining factor in new research, may be arduous and slow. RESULTS We describe the Protein Active Site Template Acquisition (PASTA) system, which addresses these problems by perform...

متن کامل

Aswaacc Automatic Semantic Web Annotation by Applying Associative Concept Classifier in Text

After appearance of semantic web, the framework which is machine-readable and machine-understandable, by Berners Lee, current web should be annotated by W3C standards in order to define semantic domain of each word by its ontology to alleviate the posed problems in the realm of search and information retrieval. However annotation is one major problem in the semantic web domain, which is present...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002